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Spectral sets and if-spectral sets, introduced by John von Neumann in [vNeSlj . offer a 
possibility to estimate the norm of functions of matrices in terms of the sup-norm of the 
function. Examples of such spectral sets include the numerical range or the pscudospcctrum 
of a matrix, discussed in Chapters 16 and 18. Estimating the norm of functions of matrices 
is an essential task in numerous fields of pure and applied mathematics, such as (numerical) 
linear algebra |Gre971 |Hig08| , functional analysis |PauQ2| . and numerical analysis. More 
specific examples include probability [DD99| , semi-groups and existence results for operator- 
valued differential equations, the study of numerical schemes for the time discretization of 
evolution equations [CroOSj . or the convergence rate of GMRES (Section 41.7). 

The notion of spectral sets involves many deep connections between linear algebra, oper- 
ator theory, approximation theory, and complex analysis. One requires for example simple 
criteria such that the closed unit disk, the numerical range, or another given set in the 
complex plane is /^-spectral for a given matrix A. In order to study sharpness in matrix 
norm inequalities, one may look for extremal matrices or operators. It is also of interest 
to consider joint spectral sets of several matrices. How are notions like functional calcu- 
lus, similarity, or dilation related to spectral sets? Is the intersection of spectral sets also 
spectral, and what arc optimal constants? How to apply the theory of spectral sets to the 
approximate computation of matrix functions? The aim of this chapter is to give at least 
partial answers to these questions, and to present a survey of the modern theory of spectral 
and A'-spectral sets for operators and matrices. 



107.1 Matrices and Operators 

Though many of the examples presented below are for matrices, it is more natural to 
present the theory of spectral sets in terms of operators acting on an abstract Hilbert space 
H. Those readers preferring matrices can always think of the Hilbert space C" equipped 
with the usual scalar product. 
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For the convenience of the reader, some few basic properties of operator theory are col- 
lected below. Standard references are the books |RN55[rSNF70tjNik02j . All definitions reduce 
to known ones in the case of matrices. 

Definitions: 

For a complex Hilbert space H with scalar product (•, •) we denote by C{H) the normed 
space of all bounded linear operators on H. The operator norm oi A £ C{H) is defined 
by 1 1 A 1 1 sup{||^a;|| : ||a:;j| = 1}. A contraction is a linear operator whose operator norm 
is not greater than one, while a strict contraction is an operator A such that ||^|| < 1. 
An operator A S C{H) is normal if A commutes with its Hilbert space adjoint A*. The 
operator A is unitary if AA* ^ A*A = I, where / represents the identity operator. The 
operator A is Hermitian, or self-adjoint if A* = A. We say that the operator A E C{H) 
is positive semidefinite, and we write A ^ 0, if {Ax,x) > for each x £ H. 

The spectrum a{A) of A is the set of all complex numbers z such that ^ — 2/ is not 
an invertible operator. The spectral radius p{A) of the operator A is given by p{A) = 
sup{|A| : A <S (j{A)}. The numerical range, or field of values, of an operator A is defined 
by ly(^) := {{Ax,x) : \\x\\ = 1}. The numerical radius oi A E C{H) is defined by 
w{A) sup{|A| : A G VK(^)}. An operator yl is a numerical radius contraction if 
w{A) < 1. 

For s > 0, we say that the operator A E C{H) belongs to the class of Sz.-Nagy and 
Foias if the inequality 2^||^Aa;|p -I- 2^^re((CAx, x)) < ||.t||^ holds true for every x & H 
and every complex number C with \C,\ < 1. The operator radius Ws associated with the 
class Cs may be defined by Ws{A) = inf {r : r > 0, ^A E Cg} ■ 

The following notation will be used throughout this chapter: for a set M C C we denote 
by DM its boundary and by M its closure. D is the open unit disk, T = 9D is the unit circle 
and X always denotes a closed subset of C. 

Definitions: 

TZ{X) and C{X) are the sets of complex- valued bounded rational functions on X, and 
complex-valued bounded continuous functions on X, respectively, equipped with the 
supremum norm = sup{|/(a;)| : x E X}. 

Facts: 

The following facts can be found in jSNF66| ISNFTO] . 

1. We have ||A||/2 < w{A) < \\A\\, and \\A\\/s < Ws{A). See also Chapter 18. 

2. Ws is a norm on C{H) if and only if < s < 2. 

3. lims^oo u)s{A) = p{A) for every A E C{H). 

4. The class Ci is the class of all contractions and u'i(A) = 

5. The class C2 is the class of all numerical radius contractions and W2{A) = 'w{A). 
Examples: 

1. The space €^ of complex square summable sequences x = (xq, xi, 2:2, . . .) with scalar 
product {x,y) = J^JLo ^jVj s-^d norm ||x|| = a/ (x, x) is an example of an infinite 
dimensional Hilbert space. Here the action of a linear operator can be described by 
the matrix product Ax with an infinite matrix A. The shift operator 5* acting on 
i'^ via S{zo, zi, . ■ .) = (0, zq, zi, . . .) has the norm given by ||5|| = 1, the spectrum 
C7{S) = D, and the numerical range W{S) = D. 

2. We will sometimes also make use of Banach spaces like the space £p of complex 

sequences x = (xq, xi, X2, . . .) with norm ||x|| = (X^jlo NjI^)^^^ where again the 
action of bounded linear operators is described by infinite matrices and the matrix- 
vector product. If one considers the subspaces of sequences where only the first n 



Spectral sets 



107-3 



entries are non-zero, then there is a canonical isomorphism with C", equipped with 
the vector Holder p-norm. Similarly, if the infinite matrix A contains non-zero entries 
only in the first n rows and colmims, then the operator norm j|j4||£(^p) coincides with 
the matrix Holder p-norm of the corresponding principal submatrix of order n. 

3. Another example of an infinite dimensional Banach space is the set of complex- 
valued functions defined on some set X, with ||/|| := (^J^\f{z)\P dii{z)'^^^^ for a 
suitable measure fi on X . 

4. Since / € is analytic on a neighborhood of X, we get from the maximum 
principle for analytic functions that 

WfWx = sup{|/(a:)| -.xeX}^ sup{|/(a;)| : x G dX}. 

In particular, TZ{X) can be seen as a subalgebra of C{dX). 



107.2 Basic Properties of Spectral Sets 

Let _ff be a complex Hilbert space and suppose that ^ is a bounded linear operator on H. 
Suppose now that (j{A) is included in the closed set X and that f = p/q & Ti{X). As the 
poles of the rational function / are outside of X, the operator f{A) is naturally defined as 
f{A) = p{A)q{A)~^ (see Chapter 11) or, equivalently, by the Riesz holomorphic functional 
calculus |RN55j . 

We can now define the central objects of study of this chapter. 

Definitions: 

Let H he a. complex Hilbert space and suppose that A E C{H) is a bounded linear operator 
on H. Let X be a closed set in the complex plane. 

For a fixed constant K > the set X is said to be a iiT-spectral set for A if the spectrum 
<j{A) of A is included in X and the inequality ||/(A)|| < iir||/||x holds for every / G TZ.{X). 

The set X is a spectral set for A if it is a /-C-spectral set with K = 1. 

Facts: 

Facts requiring proof for which no specific reference is given can be found in |RN55[ Chapter 
XI] or |vNe51| . 

1. Von Neumann inequalities for closed disks of the Riemann sphere: 

(a) a closed disk {z £ C : |z — a| < r} is a spectral set for A e C{H) if and only if 

< r. 

(b) the closed set {z € C : jz — a| > r} is spectral for A E C{H) if and only if 

(c) the closed right half-plane Cq = {re(z) > 0} is a spectral set for A if and only 
if re((Av, v)) > for all v G H. More generally, any closed halfplane is spectral 
for A if and only if it contains the numerical range W^(yl). 

2. Suppose that X is a spectral set for A and let / e TZ{a{A)). Then f{X) is spectral for 
f{A). More generally, if (/„) € 7^(X), lim |l/„ - f\\x = 0, and lim ||/„(A) - B\\ - 0, 
then f{X) is a spectral set for B. 

3. Any closed superset of a spectral set is again spectral. Any closed superset of a K- 
spectral set is again if -spectral. 

4. The spectrum a{A) of A e C{H) is the intersection of all spectral sets for A. 
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5. |Will67[ Any spectral set contains a minimal spectral set, i.e., a spectral set having 
no proper closed subset which is spectral. 

6. |vNe51j If the operator A e C{H) is normal, then the spectrum <j{A) is a (minimal) 
spectral set for A. 

7. |Will67j Let ^ be a Hilbert space operator, X a set containing (j{A), and let zq be 
an interior point of X. If < for each rational function / G T^iX) which 
vanishes at Zq, then X is a spectral set for A. 

8. |Pau021 page 18] Let A be a Hilbert space operator and let X be a closed set in the 
complex plane. Let S := TZ{X) + TZ{X) regarded as a subset otC{dX). If X is spectral 
for A, then the functional calculus homomorphism from TZiX) to C{H) defined by 
/ I— ^ /(A) extends to a well defined, positive map (i.e., it sends positive functions to 
positive semidefinite operators) $ oiiS which sends f+'g to f{A)+g{A)* . Conversely, 
if $ is well defined on S, sends / + 5 to f{A) + g{A)* and is a positive map, then X 
is spectral for A. 

9. Spectral sets and matrices with structure : 

(a) A complex matrix ^ € C"^" is normal if and only if its set of eigenvalues <j{A) 
is a spectral set for A. 

(b) The unit circle T is a spectral set for A e C{H) if and only if A is unitary. 

(c) The real axis M is a spectral set for A e C{H) if and only if A is self-adjoint (= 
Hermitian) . 

(d) [Will67] Let ^ be a completely non normal matrix, that is, the triangular factor 
in a Schur decomposition of A is not block diagonal. If j| Aj| = 1 then the closed 
unit disk is a minimal spectral set. 

10. Let s > 0. If ^ belongs to the class Cs, then the closed unit disk is A'-spectral for A. 
|SNF66j We can take i^ = 2s-lifs>l. jOA75j K = max(l, s) is the best possible 
constant. 

11. |Nevl2j Lemniscates as /^T-spectral sets: let p be a polynomial with distinct roots 
and let A G C{H). Let R > satisfy ||p(yl)|| < R and be such that the lemniscate 
{z G C : \p{z)\ = R} contains no critical points of p. Then {z G C : \p{z)\ < R} is a 
iiT-spectral set for A. 

12. |KM66j The closed disk 3D of radius 3 is spectral for every Banach space contraction: 
if A is an operator acting on a complex Banach space E such that ||^||£(£;) < 1, and 
p is a polynomial, then ||p(j4)||£(£;) < sup{|p(2:)| : l^j < 3}. The constant 3 is the best 
possible one. 



Examples: 

1. Suppose that —1 is not in the spectrum of the square matrix A of order n. If the 
right half-plane C^j" = {re(z) > 0} is a spectral set for A then ||/(A)|| < ||/||c+ = 1 

for f{z) = by definition of a spectral set. In order to see that also the converse 
is true, suppose that ||/(^)|| < 1. We set u = (A + /)v, and observe that 

< ||u||2 - ||/(A)uf = ||(A + /)v||2 -\\{A- /)vf = 4re((Av, v)). 

In particular, contains the spectrum of A, and is spectral for A by Fact 
1107.21101 

2. Let A e C{H), and a, /3, 7, 5 e C, aJ-/?7 7^ 0, -5^ ^ a{A). Then both f{z) = ^f±f 
and its inverse are rational functions. Applying twice Fact 1107.2121 we see that X is 
spectral for A if and only it f{X) is spectral for f{A). Thus for a proof of Fact ll07.2lll 
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one only needs to show that ||^j| < 1 imphcs that D is spectral for A, sec for instance 
|H,N551 Section 153]. 

3. Let the matrix A be block diagonal with A — dia.g{Ai , A2) . Then wc have f{A) = 
diag(/(Ai), /(A2)) and hence X is A'-spectral for A if and only if it is /C-spectral for 
Ai and A2. 

4. If the matrix A can be factorized as A = Ci?C"\ then f{A) = Cf{B)C^^. In 
particular, if X is ii'-spcctral for B then it is also A''-spcctral for A with K' < 

K\\C\\\\C-\ 

5. If the matrix A is not diagonalisable, then no finite set X can be /^-spectral for A. 
For a proof, according to Example 1107.2131 and Example 1107. 2141 we may suppose that 
A is a Jordan block of order > 2. Suppose that X — {xi, ...,Xk} is finite. Direct 
computations show that f{A) ^ for f{x) = (a; — xi) ■ ■ ■ (x — Xk), and hence X 
cannot be /^-spectral for A. 

It follows that the spectrum a (A) of a matrix A is A'-spectral if and only if A is 
diagonalisable, and in this case we can take as K the condition number of the matrix 
of eigenvectors. See also Example 1107.5141 

6. From Example 1107.2141 we see that a normal, Hermitian, or unitary matrix A has 
(t(A),K, or T, respectively, as a spectral set. Let us show the converse result claimed 
in Facts I107.2igam07.2l9c1 

If R is spectral for A, then according to Facts [T(J7. 2121 and 1107.2131 the right half plane 
Cd is spectral for iA and for —iA, which together with Fact I107.2llcl implies that 
{Au, u) S M for all u. Hence A is Hermitian. 

If T is spectral for A, then /(T) = M is spectral for f{A), f{z) = if^, according to 

Fact 1107.2121 Hence f{A) is Hermitian, implying that A is unitary. 

Finally, suppose that a{A) = {Ai, A„} is spectral for A. Then, denoting by (.1, 

the Lagrange interpolation polynomials for cr(A), that is, 0/ polynomial of degree 

< n — 1 with tj{\k) = for j 7^ k and = 1 for j = k, wc find that 

n n 

z = ^ Xj£j{z) and hence A = ^ Xj£j{A). 

By Fact 1107.2121 ij{<T{A)) = {0, 1} C M is spectral for ij{A), implying that (jiA) is 
Hermitian. From Example 1107.2151 we know that A is diagonalisable, implying that 
the ij{A) commute, and thus A is normal. 

7. The closure X of the e-pseudospectrum oi A G C{H) is a ii'-spcctral set for K ~ 
^'^''^S^isx) g^^y e > 0, see Fact 16.3.5 in Chapter 16. 

8. [BS67f[Ol75llCro04aj The disk {z € C : |z| < w{A)} is 2-spectral for A. K ^ 2 is 
the best possible constant. This follows from Fact 1107.21101 



107.3 Around the von Neumann Inequality 

According to Fact I107.2llal stated for a = 0, if A is a Hilbert space contraction and / is a 
rational function with poles off the closed unit disk, then < ||/||jj. Since polynomials 

are dense in the disk algebra (that is, the Banach algebra of all complex-valued functions 
which are analytic on D and continuous up to (9D), the previous inequality implies that for 
every contraction on Hilbert space the rational functional calculus extends to a functional 
calculus on the disk algebra. The inequality ||/(^)|| < ||/|Id is known in Operator Theory 
as the von Neumann inequality. The aim of this section is to present several variations and 
generalizations. 
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s: 

1. |Foi57| The von Neumann inequality characterizes Hilbert spaces: if is a complex 
Banach space such that < |b||u holds for every polynomial p and every 
A £ ^{E) with ||A|j£(£;) = 1, then E is isomctrically isomorphic to a Hilbert space 
(i.e. the norm of E comes from an inner product). 

2. |Rov82] Another converse of von Neumann inequality: suppose that /o is holomorphic 
on an open subset G of D and ||/o(A)|| < 1 for every contraction A on a Hilbert 
space, with spectrum contained in G (the operator /o(^) being defined by the Riesz 
holomorphic functional calculus). Then /o is the restriction to G of a holomorphic 
function / defined and bounded by 1 on D. 

3. |KM66j The von Neumann inequality for arbitrary matrix norms: let p be a polynomial 
of degree d with complex coefficients and let ^ be a complex matrix in C"^" with 
ll^ll < 1 for some subordinate matrix norm || • ||. Then ||p(A)|| < (ttti + 1) j| p 

4. The shift as an extremal operator: define the shift on a suitable space of sequences 
by S{zo, zi, ...) = (0, zq, zi, . . .). 

(a) [PPS02[rDix95| An operator-theoretical interpretation of H. Bohr's inequality: let 
r > 0, let i? be a Banach space and let A E C{E) be such that ||v4||£(£;) < r. Then 
lb(^)ll£(£;) < \\p{rS)\\c(ii) for every polynomial p. We have \\p{rS)\\c(n ^ Mo 
if and only if r < 1/3. 

(b) [Pels 11 ICRW781 INik02| Matsaev inequality for some classes of contractions: let 
p be a real number between 1 and oo. Let A : he an isometry on a 
LP space. Then A verifies the Matsaev inequality ||/(A)||£(lp) < ||/(5')||£(fP) 
for every polynomial /. This inequality also holds for contractions on L^ which 
preserve positive functions, or for disjoint contractions {A{f)A{g) = whenever 
fg = 0), or for operators such that ||^||£(li) < 1 and ||A||£(loo') < 1. 

(c) [Drullj Counterexample to a conjecture of Matsaev: let f{z) = 1 + 2z — ^z'^. 
There is a 2 x 2 real matrix A with a real (or complex) Holder 4-norm bounded 
above by 1 but with the Holder 4-norm of f{A) exceeding ||/(5')||£(£4'). 

5. Constrained von Neumann inequalities 

(a) }PY80| Let / and g be two polynomials. Suppose that A is a Hilbert space 
contraction with spectrum included in D such that g{A) — 0. Then ||/(^)|| < 
11/(5* I ker,g(5*))||, where S* is the backward shift, 

S*{zo, Zl,Z2, ■ ■ ■) = (^1, -22, ■ ■ ■): 

which is the adjoint of S acting on i"^ . 

(b) |HdlH92| Let A be a Hilbert space nilpotent contraction with A" = 0, n > 2. 
Then 

wiA) < w(S* I ker (S*"*))) = cos 

n + I 

(c) [BC02| Let n > 2. Let A e C{LI) be a contraction such that A" = 0. Then for 
each s > and each polynomial /, we have Ws{f{A)) < Ws{f{S*^)). Here S* 
denotes the nilpotent Jordan block 

rO 1 ••• 01 



1 
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which is unitarily equivalent to S* \ C" = S* \ kcr S*"* . In particular, for any m 
we have 

w{A"')< cos ^ fc(m,„) 

k[m, n) + 2 m 

Also, if the degree of the polynomial / is at most n — 1, then 



infsup{|/(C)| :CeC,C'"-i = e^«} 



Ws{f{A))<[--l\\\f\\ 



if s e (0, 1], and 



Ws{f{A)) < ||/||: 



inf sup{|/(C)| :CeC,C 



s-1 



if s G [1,2]. 

(d) |BC02| Let A e C{H) he a. nilpotent Hilbert space operator satisfying 

I-A*AhO, I ~2A*A + A*^A'^ hO, 
and A" =0,n>2. Then 

WsifiA)) < w^ifiB:)) 
for all s > and all polynomials /. Here B* is given by the matrix 





Bl = 











Jl 







ri+1 





which is unitarily equivalent to a compression of the Bergman shift, 
(e) [BC02] Suppose A e C{H) satisfies < 1, = and I-2A*A + A*^A^ t 0. 



Then 



w{A) < 



7 9 / 1 

— and w(A'^) < \ — 
24 ^ ^ - V 12 



and these constants are the best possible ones. 

(f ) [BC02| The link between the extremal operator in the constrained von Neumann 
inequalities and the Taylor coefficients of rational functions positive on the unit 
circle T: let F ~ P/Q be a rational function with no principal part and which 
is positive on the unit circle. Then the fcth Taylor coefficient Ck satisfies \ck\ < 
cow{R''), where R = S* \ ker(Q(S'*)). In particular, if P(e'*) = E"=-„+i Cje*^* 

is a positive trigonometric polynomial, n > 2, then \ck\ < cq cos( 7 ) for 

I k 1+^^ 

I < k < n — 1 and, for every distinct numbers k and I among {0, . . . ,7i — 1}, 
there exists 7 G R such that \ck \ + |q| < cow{S^^ + e^'^Sl^). We also have 



1/2 



1/2 



\ck\ + IqI < Co I 1 + cos ^„_ij ^ ^ 



1 + cos ■ 



ra-1 1 
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Examples: 



1. |RN551 page 436] The von Neumann inequality fails for the Holder l-norm in (or 
in £^): consider 



A 



1 

1 



3/5' 

3/5 4i/5 



then II A ll 1 = 1, ll/lln = 1, but ||/(A)||i = ^ > 1. 
2. [Pau021 page 23] The Schwarz-Pick lemma as a consequence of von Neumann inequal- 
ity: Let p be a polynomial such that \\p\\j^ < 1. Let a,b and c be complex numbers 
such that |a| < 1, |c| < 1, a ^ c, and |6p = (1 - |ap)(l - |cp). Consider the 2 x 2 
matrix 

a b 



A^ 







acting on the Euclidean space C^. Then ||yl|| = 1 

p{A) 



p{a) 



Pic) 

and the von Neumann inequality ||p(j4)|| < ||p||jj implies that 



p{a) - p{c) 



< 



1 b(a)|2 1 - |p(c)|^ 



l-|a|2 l-|c|2 • 

Using the identity |1 — uv\'^ = (1 — |mP)(1 — |wp) + \u — v\'^, this can be written as 

p{a) - pic) 



1 -p(c)p(a) 



< 



1 — ca 



107.4 The Multidimensional von Neumann Inequality 

Definitions: 

We say that the multidimensional von Neumann inequality holds for a fixed n-tuple 
of commuting operators A = (Ai, A2, . . . , An) if 

|b(Ai,...,A„)||<|b|l5" 

for every polynomial p in n (commutative) variables. 
Facts: 

We use [Nik02j as a general reference. 
1. The multidimensional von Neumann inequality holds in the following situations: 

(a) [And63| for a pair of commutative Hilbert space contractions (n = 2). 

(b) for a commutative family of isometries. 

(c) for a family of doubly commuting (i.e., AiAj = AjAi for all i and j and A*Aj = 
AjA* whenever i 7^ j) contractions. 

(d) [Bre61j for a commutative family A such that ll^jll^ — 1- 
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2. |AM05| Distinguished varieties: let Ai and A2 be two commuting contractive ma- 
trices, neither of which has eigenvalues of modulus one. Then there is a polynomial 
q S C[2:,u'] such that the algebraic set V = {{z,w) E : q{z,w) = 0} verifies 

^2)11 < I IpI I V for any polynomial p in two variables and yn9(]D)^) ~Vn{dBi)'^ 
(the variety exits the bidisk through the distinguished boundary). 

3. Extremal n-tuples. 

(a) [Dru78[ |Pop99[ Erv98j The Drury-Arveson space and the von Neumann inequal- 
ity of Drury-Popescu-Arveson: let B" = {z € C" : X)j=i kiP < 1} t"*^ ^^e unit 
open ball in C". Let DAn be the Drury-Arveson space of all power series g such 
that ^ 

Q>0 a>0 ' 

where a = (ai, . . . , «„) e N", \a\ = X]j=i '^j ~ nj=i('^i')- The Drury- 

Arveson space DAn can be also regarded as the reproducing kernel Hilbert space 
with kernel 

k,iz) = ——^ =, z,AgB". 

Let Sjg{z) = Zjg(z), where z = (zi, . . . , z„) and g g DAn- Let A = {Ai, . . . , An) 
be a commutative ?i-tuple on a Hilbert space H such that / ^ ^i^j' 
equivalently, 

n 

for all Xj e H. Then 

\\p{A,,...,An)\\ < MSi,...,Sn)\\ 

for every polynomial p. 

(b) }Dru78j A dual version: let S* , j — 1, . . . ,n, be the backward shift operators 
on the Cauchy dual -DA* of the Drury-Arveson space. This means that g = 
J2a>o'^aZ°' with \\g\\l,A>^ = Ea>ol«"l^^ < and a„(S'*.9) a^+s^ig) for 
all a > 0, where Sj = {Sjk)i<k<n- Let A ~ {Ai,...,An) be a commutative 
n-tuple on a Hilbert space H such that / ^ ^j^j^ equivalently, 

for all X e . Then 

MA,,...,An)\\<Msi...,s:)\\ 

for every polynomial p. 

4. |Mla71j Mlak's von Neumann inequality with operator coefficients: let Ck H ^ H 
be bounded linear operators, k = 0, . . . ,n, and A € C{H) be a contraction which 
double-commutes with the C^'s, i.e. ACfc = C^A and AC^ = C'^A for every k. Then 



n 








< sup 




J=0 


NI<1 





71 

<Eii^.i 
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5. |Lub78| Let (^i, . . . , An) be commuting contractions on a Hilbert space H. Then the 
polydisk {(zi, . . . , z„) : \zi\ < ^/n, 1 < j < n} is a spectral set for {Ai, . . . , A„). 

6. |Hir72| The Poisson radius: let A = (Ai, . . . , An) be an n-tuple of commuting oper- 
ators on a Hilbert space H, with spectra included in D. Define 

P,irA,,Q) = re {{{QI + rA,){C,I - rA,)-')) 

for < r < 1, Q Cz T, 1 < j < n, and the Poisson radius rp(A) of A as the supremum 
of r G [0,1) such that 

^ X] ^'^(1) (^"^'^(1) ' C£t(1) ) • • ■ ^'<T(n) {rA^i^n) , Ca{n) ) 

is a positive operator for every C G T", where a runs over all permutations of 
Then < rp{A) < 1 and ||/(rp(A)A)|| < for every polynomial 

f, 

7. |Agl90| The Schur-Agler class: let / be an analytic function of n complex variables 
A = (A^,--- ,A"). Then f{rAi,...,rAn) has norm at most 1 for any r < 1 and 
any collection of n commuting contractions {Ai, . . . , An) on a Hilbert space if and 
only if there are auxiliary Hilbert spaces Hj, 1 < j < n, and an isometry V G 
£(C®7Ji®- • -eiJn) such that, ii H ^ Hi® - ■ ■®Hn, V is written with respect to C®iJ 

asV= ^ ^ , a.nd£x = X^lH,®---y'lHr., then f{X)^A + B£x{lH~D£x)~^C. 

8. |Boz89j Bozejko's von Neumann inequality for non-commuting tuples: let Ak, 1 < 
k < n, he (not necessarily commuting) contractions on H. Let / = /(xi, . . . ,a;„) be 
a polynomial in the noncommutative indeterminates xi , . . . , x„ . Then 

||/(Ai,...,A„)|| < sup{||/(t/i,...,t/„)|| : Uj unitary matrices on C'"''",m G N}. 



Open Problems: 

1. |Dix76| It is not known if for each n there exists a finite constant C„ such that for 
any commuting contractions Ai, . . . , An and any polynomial f in n variables one has 

||/(Ai,...,A„)|| <C„||/||5.. 

It is generally believed that such a constant C„ does not exist. One knows that C„ 
must increase faster than any power of n. 



Examples: 

1. |Var74j The multidimensional von Neumann inequality fails in general for n > 3 and 
matrices Ai, . . . , A„ G C'^'^''. 

2. |Var74j The multidimensional von Neumann inequality can fail with n = 3 and d ~ 5. 
The three matrices Ai, A2, A3 G C^^^ are commuting contractions (with respect to 
the Euclidean norm), and the polynomial p(zi, Z2, ^3) = 2^ + + — 2ziZ2 — 221^3 — 
22223 satisfies \\p\\j3 = 5 and ||/(yli, A2, ^3)!! > 5: 
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3. |CD75| Denote by e^, 1 < j < 8, the vectors of the standard orthonormal basis of 
C^. Let p(zi, Z2, 23) = Z1Z2Z3 — zf — Z2 — z^. There exist three commuting matrices 
Ak e C«''^ k = 1,2,3, such that \\A,\\ < 1 and 1133(^1,^2,^3)1! > 4 > ||p|1t3. The 
contractions Ai are acting on the orthonormal basis as follows: 



Ai 


ei — 


62 — > (-65) — 


^ (-68)- 


0, 


63— 


67— 


^ 0,64 — 


66 — 





A2 


ei — 


63 — > (-eg) — 


^ (-68)- 


0, 


62— 


-> 67 — 


^ 0,64- 


-> 65 — 


-> 


A3 


ei — 


64 — > (-67) — 


^ (-68)- 


0, 


62— 


-> 66 — 


^0,63- 


65 — 






4. |Var74j Given A' > 0, there exist a positive integer n, commuting operators Ai, . . . , A, 
and a polynomial p such that J2]=i ll^jlP — 1 ^^d 

|b(Ai,...,A„)|| > A' sup J |p(zi,...,z„)| : ^|zjf < 1 I . 



107.5 Dilations, Complete Bounds and Similarity Problems 



Definitions: 

For a given closed set X of the complex plane, we say that A G C{H) has a normal c)X- 
dilation if there exist a Hilbert space H containing H and a normal operator N onH with 
cr(iV) C ax so that 

f{A) = F«/(7V) 1^ 

for every rational function / with poles off X. Here Ph is the orthogonal projection of "H 
onto H. 

If X = D, then is a unitary operator and we say that A has a unitary (strong) 
dilation. Notice that in this chapter, contrary to Chapter 18, a unitary (strong) dilation 
A is a common dilation of all powers of A. 

For a fixed s > 0, we say that A £ C{H) has a s-unitary dilation if there exists a 
Hilbert space T-L containing H and a unitary operator U on % such that 

A" = sPfft/" la, in > 1). 

The last two definitions agree in the case s = 1. 

We denote by M„(7?.(X)) the algebra of n by n matrices with entries from Tl{X). 

Considering the (spectral) Holder 2-norm for matrices in C"^", we can endow Mn{Tl{X)) 
with the norm 

II (/y)i<,,,<„ lU = sup{|| (/,,(x)),<^^^.<„ \\:xeX}^ sup{|| (/.,(.t))i<,^^.<„ \\:xe dX}. 

In a similar fashion we endow Af„(£(iJ)) with the norm it inherits by regarding an element 
{Aij)i<:ij<:n in Mn{C{H)) as an operator acting on the direct sum of n copies of H. 
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For a fixed constant K > 0, the set X is said to be a complete /v-spectral set for A if 
cr{A) C X and the inequality |l(/jj(A))i<ij<„|| < Jir||(/y)i<,.j<„||x holds for every matrix 
ifij) e Mn{TZ{X)) and every n. 

A complete spectral set is a eomplete A'-spectral set with K = 1. 

We also say that A is power bounded if sup„ < oo. 

Two Hilbert space operators A and B are said to be similar if there exists an invertible 
operator L such that B ~ L^^AL. 



Facts: 

All the following facts except those with a specific reference can be found in jSNF701 IPieOll 
IPau02) . We denote by A a Hilbert space operator and by AT a closed subset of C. 

1 . |Arv69[ IArv721 IPau84b| A has a normal (?A-dilation if and only if A is a complete 
spectral set for A. 

2. A is completely /•sT-spectral for A if and only if A is completely spectral for an operator 
B e C{H) similar to A, say, B ^ L-^AL, with \\L-^\\ \\L\\ < K. 

3. Each completely spectral set for A is spectral. Conversely, a spectral set for A is 
completely spectral in the following situations: 

(a) if A is a closed disk; 

(b) |Agl85 if A is an annulus; 

(c) [Pau02i Theorem 4.4] if C\ A has only finitely many components and the interior 
of A is simply connected or, more generally, if Ti{X) + TZ(X) is dense in C{dX). 

(d) [AHR08[ IDMC05) There is a closed set A in C having "two holes" and an 
operator A such that X is spectral for A but not completely spectral. |PiclO| 
More generally, such a counterexample exists whenever A is a symmetric domain 
in C with n holes, 2 < n < c». 

(e) |DP86[ IPau02| Let the boundary of the compact A C C consist of n + 1 disjoint 
Jordan curves. If A is a spectral set then it is a complete (2n + l)-spectral set. 

4. Each completely X-spectral set for A € C{H) is i^-spectral. |Pau87| The converse is 
true whenever A is a 2 x 2 matrix. [Pie97| However, there exists an example where B 
is ii'-spectral for A G C{H) but not completely if'-spectral for any K' . 

5. A combination of these facts yields several corollaries which historically came first: 

(a) (Sz.-Nagy dilation theorem) Every Hilbert space contraction has a unitary dila- 
tion. 

(b) (Berger dilation theorem) Every numerical radius contraction has a unitary 2- 
dilation. 

(c) (Paulsen criterion) An operator A is similar to a contraction if and only if it has 
the closed unit disk as a complete if-spectral set. 

6. Operators of class Cs- 

(a) [SNF66[ |0A75| Let s > 0. Every operator of class Cs has a unitary s-dilation. 

(b) Let s > O.IfA is of class Cs, then 11/(^)11 < max{|s-/(z)-f (l-s)-/(0)| : \z\ < 1} 
for every polynomial /. 

(c) Let iJ be a complex Hilbert space of dimension > 2. Then the class C^ increases 
with s: we have Cs C Cs' and Cs Cs' for < s < s' . The set of operators 
acting on H which belong to one of the classes Cs, for some s > 0, is dense in 
the strong operator topology in the set of all power bounded operators. 
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7. Unitary dilations for n-tuplcs: 

(a) [And63j Every pair of commuting contractions on a Hilbert space has a pair of 
commuting unitary dilations. 

(b) |GR69| Every n-tuple A ~ {Ai, . . . , A„) e C{H)" which is cyclic commutative, 
i.e. 

A1A2 ...A„ = AnAi . . . A„_i = . . . = . . . A„Ai, 

has a cyclic commutative dilation to an ri-tuple of unitaries. 

(c) [Ope06| Let G be an acyclic graph on n vertices {1, 2, . . . , 71} (this means that 
it does not contain a cycle as a subgraph). Let A ~ (^1,^2, . . . , A„) be an n- 
tuple of contractions on a Hilbert space that commute according to G, that is 
AiAj = AjAi whenever is an edge of G. Then there exists an 71-tuple U 
of unitaries on a larger Hilbert space that commute according to G such that U 
dilates A. This property may fail if G contains a cycle. 

8. Power boundedness and similarity to a contraction (i.e., D is completely X-spectral) : 

(a) |Rot60[ IHer76[ IVoi74| Rota type theorems: If ^ e C[H) with cr(A) C D, then 
A is similar to a contraction. More general results are true for some closed sets 
X C C and for operators A with a {A) C X. 

(b) [Bad03| A Banach space Rota theorem and Matsaev inequality: Let Ei be a 
Banach space and suppose that A S C{Ei) has ct{A) C D. Then, for every p > 1, 
there exists a Banach space E2 which is a quotient of the set £p{Ei) of -Ei-valued 
sequences and an isomorphism L : E2 ^ Ei such that, ii B = L^^AL G £(i?2), 
then 

\\fm\c(E.) < \\fiS)\\cieHE,)) 
for each polynomial / and. even more generally, 

ll(/jj(-S))l<ij<n||£(_EJ) < ||(/jj(S'))l<ij<n||£(£P(_Ei)") 

for all matrices of polynomials. 

(c) If A is compact and power bounded, then A is similar to a contraction. 

(d) [Fog64[ lLeb68[ [Pie97] There is a Hilbert space operator A which is power bounded 
but for which the closed unit disk is not /C-spectral for any K, and thus A is 
not similar to a contraction. There is a Hilbert space operator A which is not 
similar to a contraction but for which the closed unit disk is A'-spectral for A, 
for some K. 

(e) |Pel821 IBou86j Let A e C{H) with M = sup„ < 00 and let p be a poly- 
nomial of degree d> 2. Then ||p(A)|j < (logc?) || p A similar result holds 
for norms of matrices of polynomials, and the log d term in the inequality is the 
best one may hope for. 

(f) [Bou86| Bourgain's estimate for matrices similar to contractions: If A is a ma- 
trix such that, with the (spectral) Holder 2-norm, ||/(74)|| < C||/||jj for any 
polynomial /, then there is an invertible matrix L such that |iL~^AL|| < 1 and 

ll^ll < KC'^\og{n + 1), where is a numerical constant independent of 

n. 

(g) [BePr98j Suppose that D is K-spectial for A e C{H). Then there exist Hilbert 
spaces Hi,H2, contractions Ai e C{Hi), A2 E -^(^2) and injective linear oper- 
ators Xi : Hi H,X2 : H H2 with dense ranges such that XiAi = AXi 
and A2X2 = X2A. See also Example fT07. 5151 
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Open Problems: 

1. |SNF70j What is the obstruction to an n-tuplc of commuting Hilbcrt space contrac- 
tions having commuting unitary dilation? 



Examples 
1 



The (unilateral) backward shift operator S* G C{1'^) defined by {S*x)j = Xj+i, 
j G N, is easily seen to be a contraction, with spectrum (j{S*) = D. Hence X = D is 
a (completely) spectral set for S* . For quadratic summable sequences x indexed by 
Z, one defines the bilateral backward shift operator B through {Bx)j = Xj+i, j £ Z, 
which is clearly unitary. Since {B"x)j = Xj+n, j € Z, n > 1, we see that i? is a 
1-unitary dilation and thus a unitary dilation of S* . 

The notion of (weak or strong) unitary dilation is a nice illustration of why it is 
important in linear algebra to have sometimes recourse to infinite dimensions: we 

\A *] 

say that A e C"X" is imbedded in B G C™^'" if i? = . Given a contraction 

A G C"^", is there a unitary B £ C"x™ such that p{A) is imbedded in p{B) for any 
polynomial p of degree at most fc? This property, related with the exactness property 
for Krylov spaces, reduces to (strong) unitary dilations for k = oo considered above, 
and for fc = 1 to the (weak) unitary dilation considered in Section 18.6. The latter 
problem has the solution |Hal50j 



B 



A {I-AA*y/^' 
(/-^*A)i/2 -A* 



"0 0' 




'0 





,^3 = 


'0 0" 


/ 


u 





V 



Egervary |Ege54| showed that such an imbedding is always possible with m = (fc+l)ri: 
for example, one may imbed the n-dimensional shift in an unitary circulant of order 
m = {k + l)n. It is also known that in general such an imbedding for finite m is 
impossible for fc = oo, see also |LS13|lMS13j for more modern aspects of this question. 
|Par70| Let U and V be two contractions in C{H) such that U is unitary and UV ^ 
VU. We define three commuting contractions in C{H © H) by defining 

Ai 

Then the commuting triplet A = (Ai, A2, A3) verify the multidimensional von Neu- 
mann inequality but do not possess a commuting triplet of (strong) unitary dilations. 
Let A E C"^" be diagonalizable. Then A is power bounded if and only if a (A) C D, 
and in this case A is similar to the contraction given by the Jordan canonical form. 
Here D is (completely) A'-spectral, with K the condition number of the matrix of 
eigenvectors. 

If A G C"^" is not diagonalizable, then (t{A) C D still implies (through, e.g., the 
Cauchy integral formula and the pseudo-spectrum) that D is completely /•sT-spcctral 
for some K. However, in general, the Jordan canonical form is no longer a contraction. 
|MT07j Let T e C{H) and S G C{K) be Hilbert space operators. We say that T is a 
quasiaffine transform of 5" if there exists an injective operator A : H 1-^ K with dense 
range such that AT = SA. We say that T is quasisimilar to S if each operator is a 
quasiaffine transform of the other. 

There exists a power bounded operator on a Hilbert space which is not quasisimilar 
to a contraction. 

|Pie97[ IDP971 IBad031 IBPOli IRic02j Let a = (ao, ai, . . . ) be a sequence in i'^ and set 



R{a) 



S* 




Yia) 
S 



c{e{H)®e{H)), 
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where S is the shift on the Hilbert space l'^{H) of iJ- valued square summable se- 



quences, H is of infinite dimension, and Y{a) 



y-i+j Ci+j 



i,i>o 



where the Cw's are 



operators verifying the canonical anticommutation relations CiCj + CjCi = and 
CiC* 



Then the operator R{a) is polynomially bounded if and only if suP;,>q(A;-|-1)^ X)i>fc lo^jP 
is finite, and R{a) is similar to a contraction if and only if J2 k>oi^ ~^ finite. 



107.6 Intersections of Spectral and JC-Spectral Sets 

In this part we will discuss intersections of spectral sets, including the annulus problem 
of Shields |Shi74j . It is known that the intersection of two spectral sets is not necessarily 
a spectral set, see |Will67[ IMis84[ IPau02| and Example 1107.6111 below. However, the same 
question for iiT-spectral sets remains open, though the above examples indicate that one 
may not use the same constant. We refer to |Pau88| and the book jPau02] for modern 
surveys of known properties of if-spectral and complete if-spectral sets. 





Facts: 

1. |DP861 lPau02[ If two i^-spectral sets have disjoint boundaries, then their intersection 
is a if '-spectral set for some K' . 

2. |Sta86a] The intersection of a simply connected spectral set X oi A e ^{H) whose 
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interior has finitely many components with the closure a simply connected open set 
G containing the spectrum of A is a A''-spcctral set for A for some K'. 
Weaker versions of this statement concerning the connectivity of X and/or G have 
been given in |Sta86b| ILew90| . 

3. [Lew90j The intersection of a (complete) ii'-spectral set for a bounded linear operator 
A with the closure of any open set containing the spectrum of A is a (complete) K'- 
spectral set for A for some K' . 

4. |BBC09| Let A e C{H), and consider the intersection X = Di n D2 n ■ ■ ■ n of 
n disks of the Riemann sphere C, each of them being spectral for A. Then X is a 
complete if-spectral set for A, with a constant K < n + n(n— 1)/\/3- 

(a) [Cro07j If in this result we add the requirement that the disks Dj and thus X 
are convex, then X is a complete 11.08-spectral set for A. 

(b) For n = 2 we obtain the constant K = 2 + 2/ \/3 for various configurations as 
shown in Figure [lO 7. 1[ in particular for a strip/sector obtained by the intersec- 
tion of two half-planes and discussed in Example I107.7l3a1 or the lens-shaped 
intersection of two disks |BC06) . 

5. |Shi74j For R > 1, consider the annulus X = X{R) ^ {z e C : R'^ < \z\ < R}, 
and denote by K{R) (and by Kcb{R) > K{R), respectively), the smallest constant K 
such that X is a iC-spectral set (and a complete X-spectral set, respectively) for any 
invertible A G C{H) verifying ||A|| < R and \\A-^\\ < R. Then 



(a) [ShiZl] KiR) < 2 ^ y 

(b) [BBCOQj Kcb{R) < 2 ^ — 



< 2 + The first upper bound is sharper 



than I107.6l5al for R < 3.1528; 

(c) |BBC09j K{R) > |. 

(d) |BBC09| Kcb{R) < max{3,2 + E^i2h|+t}: being sharper than I107.6l5al for 
R > 1.8544, and sharper than I107.6l5bl whenever R > 1.9879. It follows in 



particular that K{R) < 3 for R > 2.0953. 



(e) mmKcbiR)<2 + ^J^ 



dd, being always sharper than 1107. 6l5a[ 



sharper than I107.6l5b1 for R > 1.6405, and sharper than I107.6l5d1 whenever 
R < 2.0462. 



Examples: 

Consider the matrix 



A 



j = R 



1 

i?' 



R> 1, 



l/z which verifies ||/(A)||/|l/|U(i?) 



with ~ \\A ^11 = i? such that both sets {|z| < R} and {|z| > 1/R} are spectral for A, 
and consider the intersection X{R) = {z G C : R^^ 

1. |Mis84| The example of the function f{z) = 
2^5^ shows that X{R) is not a spectral set for R > 

2. We get a sharper statement for the function f{z) ~ g{z) — g{l/z), g[z) = R^rr^, 
leading to 

|l/(A)||-2, ll/IUw = 1^ j^2 + < 3- 

Thus X{R) is even not |-spectral for A for any R> 1. Compared with Fact 1107. 6l5cl 
we thus have shown the improved lower bound K{R) > '^j+w-y^ > |- 
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3. Let the boundary of the compact X C C consists of n + 1 disjoint analytic Jordan 
curves. Then X is iC-spectral for the above matrix A if and only if |Mis84| 

K > 7r(X), T{X) = sup\ ^{}P^ : f analytic on the interior of X, /(I) = o}. 

For the annulus X{R) = {z g C : R^^ < \z\ < R} of the preceding example, the 
quantity T{X{Rj) is computable jBBC09| . leading to the lower bound of Fact n~07.6l5cl 
Moreover, given r <S (1, i?), using the Schwarz lemma we also conclude that both sets 
{z : \z\ < r} and {z : \z\ > 1/r} are if-spectral for A with K ={R- R~^)/{r - r'^) 
but not their intersection X{r), at least for those r not too far from R. 



107.7 The Numerical Range as a 1^-Spectral Set 

Recall that the numerical range is given by :~ {{Ax,x) : = 1}. It has been 

conjectured jFW72j that the closure of the numerical range is a complete iiT-spectral set 
for A. This was proved in jDD99| . Here we report about recent results along these lines, 
including the existence of a universal such constant K shown by Crouzeix |Cro07| . For 
applications in numerical linear algebra of results of this type see for instance the discussions 
in [Ele93l[Gre97llTT99j . 

Facts: 

1. Estimates depending on the shape oi X: For every bounded linear operator A e C{H), 
every compact convex set X containing the numerical range is a complete A'-spectral 
set for A, where 

(a) [DD99| K = 3 + (27rdiameter(X)7area(X))3. 

(b) [PS05] K = l+2/(l-g(X)), with q{X) e [0, 1] being C. Neumann's configuration 
constant of X (the oscillation norm of the underlying Neumann-Poincare singular 
integral operator |Kra80| ) . We have q{X) = 1 if and only if X is a triangle or a 
quadrilateral. 

(c) |BCD06( Theorem 2.3] K = 2 + tt + Ty(log(r)), if the boundary of X is 
parametrized by [0, 2-K]3t^uj + r(t)e'*, r{t) > 0. 

2. Universal estimates: 

(a) [CroOTi Theorem 1] There exists a universal constant K = Kcrouzeix € [2, 11.08] 
such that for every bounded linear operator A S C{H), the closure of the nu- 
merical range is a complete iiT-spectral set for A. 

(b) [Cro04a| Theorem 1.1] is completely 2-spectral for every 2x2 matrix A. 

3. Mapping theorems for the numerical range: let A G C{H), and let / be analytic on 
D, and continuous up to 9B. 

(a) [BS67] w{A) < 1 if and only if for all z e C 

\\A~zi\\ < 1 + VTTFF = 



2 




zl 



(b) [BS67] If wiA) < 1, ||/||d < 1, and /(O) = then wif{A)) < 1. 



107-18 



Handbook of Linear Algebra 



(c) [DruOSj If w{A) < 1, ||/||d < 1, and |/(0)| < 1, then W{A) is a subset of the 
convex huU of the union of the disks B and {z € C : \z - /(0)| < 1 - |/(0)p}. 
Furthermore, ||/(A)|| < i^(|/(0)|), where 

i^ia) = (2 - + 2a^ + 2(1 - a^){l - + a")^/^) . 

(d) jPruOSj liw{A) < 1, then w{f{A)) < and | is the best possible constant. 

(e) [Kat65j If r is a rational function with r(oo) = 00, and if X = {z G C : \r{z)\ < 1} 
is a convex set containing then 'w{r{A)) < 1. 

Open Problems: 

1. |Cro07| Is Kcrouzeix = 2? At least if we restrict ourselves to 3 x 3 matrices? 



Examples: 

1. |Cla84j The spectrum of the Toeplitz operator acting on the Hardy space with 
symbol (f>{w) = aw + bw~^, < 6 < a, is a convex set whose boundary is an ellipse 
with semi-axes a ± 6. The spectrum coincides with the closure of the numerical range 
W{T^). This set is not spectral but il'-spectral for with K = + \b/a\'^. 
Other classes of symbols with similar properties are given in [Cla84j . 

2. |PS05j The configuration constant of Fact I107.7llbl can be estimated for (smooth) 
X: q{X) < 1 — — for dX an ellipse with eccentricity e < 1 and q{X) < 
1 — loiigth(ax) ^j^j^ niaximum radius of curvature. 

ZTZri ' 

3. X-spectral sets containing the numerical range: let A e C{H), and let X C C be 
closed and convex containing Ty(^). 

(a) [CD03[ IBCD06[ IBC07| If X is a convex sector or a strip, then X is (2 + ^)- 
spectral for A. 

(b) [Cro04b[ IBC07| If the boundary of X is a parabola or a hyperbola, then X is 
(2 -I- -^)-spectral for A. 

(c) [BC07[ Theorem 1] If the boundary of X is an ellipse with eccentricity e < 1, 
then X is {2 + )-spectral for A. 

(d) For an equilateral triangle, we have K = 2+7r+6 log(2) according to Fact ll07.7llc[ 
and for a square if = 2 -f- tt + 4 log(2). 



|Crol2b| For the 3x3 matrix 



A^ 



2 " 

e (l-e2)/^/2 
(l-e^)/^ 



and sufficiently small e > 0, the numerical range M^(A) is 2-spectral for A but not 
completely 2-spectral for A. 

|GC12j Crouzeix's conjecture is known to hold for generalized Jordan blocks where 
one replaces in a Jordan block of arbitrary size the lower left entry by an arbitrary 
scalar. 

For /(z) = z we recover from Fact 1107.7121 the well-known link between numerical 
radius and euchdean norm, namely [1,2] 9 [[^[[/^(yl) ~ ||/(A)jl/ sup,g^y(^) l/(-z)l ^ 

Kcr 



^rouzeix ■ 
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7. Consider A 



disk and, for f{z) = z, 



/(o) 2/'(oy 

, /(O). 



Here W(^) is the closed unit 



\\fiA)\\ / sup |/(z)| = 2 < A'croHzeio;. 



8. |Cro04a| Section 2] Consider A 



••2x2 



with p > 1. According to Fact 



18.1.7 in Chapter 18, is compact, with its boundary given by a ellipse with foci 

±1 and minor axis p — l/p. The matrix can be diagonalized as 



A^LBL-^, B = 



1 

-1 



Thus 



1/(^)11 



sup |/(z)| 

z£W(A) 



L = 



\.fiB)\\ 



1 



_£-l/p- 
P+I/P 

2 



ILII Hi- 



\A\\ 



sup |/(z)| 

z&W(A) 



max{|/(-l)|, 1/(1)1} 
sup \f{z)\ 

zeW[A) 



By |Cro04a[ Theorem 2.1] and some elementary calculus, the right-hand side of this 
expression is maximized for the function /q mapping conformally VF(yl) on the closed 
unit disk, with /o(0) = 0, /i;;(0) > 0. Thus W{A) is a iT-spectral set for A with 
optimal constant K = p fn{^) = ||/o(^)|| G (1:2), the last two relations following 
from the fact that an explicit formula for /o is known jCro04al Eqn. (2.2)]. 



107.8 Applications to the Approximate Computation 
of Matrix Functions 



Facts: 

Notation: A e C"^", E C C compact convex, being /C(E)-spectral for A, f a function 
being analytic on E. 

1. Polynomial approximation through Taylor sums. 

A popular method |Hig08[ Section 4.3] for approximately computing (entire) functions 
of matrices is to approach the Taylor series f{z) — J2jLo ^j^'' mth partial sum 

Sm{z) = ^"Lo^jZ-', with the error estimate }Mat93[ Cor. 2] 

\\if - S„.)iA)\\ < ^ ....^^^^^^ 

Here the right-hand side can by bounded in terms of E by the techniques of the 
preceding subsections, see, e.g.. Example 1107.2181 or Fact 1107.7121 

2. Polynomial approximation through Faber sums. 

Instead of Taylor sums, following |Hig08[ Section 4.4.1] and Section 11.7 one may also 
consider best polynomial approximants Pm of / on E, leading to the error estimate 

||(/-p„0(A)|| < A'(E)p™(/,E), p™(/,E)= min max |/(z) - p(z)|. 

dcgp<m zGK 

According to }KP67| Theorem 4], this rate pm{f,E) is achieved up to some factor 
a log(m) -I- /3 with explicit a, /3 > not depending on / nor on E by taking as pm the 
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mth partial Fabcr scries, which is defined as follows: let (j) = Tp~^ map conformally 
the exterior of E onto the exterior of the closed unit disk D, then the mth Faber 
polynomial Fm is defined as the polynomial part of the Laurent expansion of 0™ at 
c», and / has the Faber series 

flf^FA^l U-^f ^^dw, 

absolutely converging to / uniformly in E [KP67[ Theorem 5] . Notice that for E a disk 
centered at we recover Taylor series, and for E = [—1,1] Chebyshev orthogonal series 
|Trel2j . By [BR091 Section 3] one has the a posteriori bound |/m+i| < Pm(/, E) < 
2(|/m+i| + |/m+2|+- ■ •)• Estimates for the Faber coefficients of the exponential function 
can be found in jBR09[ Section 4]. 
The Faber operator |Gai87| 

oo J. 

J^{F){z) = ^ F{w) = y + E -f^^' 

j=o j=i 

for functions F analytic on D maps polynomials of degree m to polynomials of de- 
gree m, in particular J^{w"^){z) = Fm{z) for ?ti > 1. According to pccOS] . |BR09I 
Theorem 2.1] 

<2max|^^H|, 

allowing to relate best polynomial approximation of / on E to p„i(J^^^(/),lD)), and 
to derive error estimates for matrix functions which do not involve A'(E), e.g., for the 
mth partial Faber sum [BR09[ Theorem 3.2] 

oo 

\\{f-p,n){A)\\<2 |/,|. 

j=m+l 

3. Polynomial Arnoldi method. 

The Arnoldi process is a popular method for approaching f{A)b for some fixed vector 
b and large sparse A [Hig08[ Section 13.2]. The mth approximant is given by the 
projection formula Vmf{V^AVm)V^b, where Vm £ £-nxm columns spanning an 
orthonormal basis of the Krylov subspace Span{6, Ab, . . . , A"^~^b}. Typically, m n, 
and hence f{V*^AVm) can be computed by some direct method. According to |Hig08[ 
Lemma 13.4], this method is exact for / a polynomial of degree < m — 1, and hence 

\\f{A)b-V„J{V*,AV„,)V*M ^,u, ^<A^^\^^uf ut/* ^|| 
em := p < + \\{! - p)(y.^AY„,)\\ 

for any polynomial p of degree < m — 1. Supposing that C E (and thus 

Wiy^AVm) C E), we obtain by means of the techniques of 1107.812] that e™ < 
or Crn < 4, pm-i{T ^(/),]D)) < 4^°^^ \fj\, scc jBR09l Propo- 
sition 3.1 and Theorem 3.2]. 

4. Rational approximants with free poles via (Faber- )Pade approximants. 

Another popular approach (see |Hig08[ Section 4.4.2] and Section 11.7) for approach- 
ing matrix functions for functions / with singularities is to replace / by the rational 
function p/q with p, q polynomials of degree at most fc, and m, respectively, k > m— 1, 
such that the first A; -I- m -I- 1 terms in the Taylor expansion oi f — p/q at zero van- 
ish. Here one hopes that the poles of this [k\m] Pade approximant p/q do mimic the 
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singularities of /. Though this approach is also applied for entire functions like the 
exponential function |Hig08[ Section 10.7.4], the error is best understood for Markov 
functions 

j{z)~c+ I , c G M, /i some positive measure, 

where we suppose that /? < —w[A). This includes (up to some variable transforma- 
tions) functions like log(z), l/^/z or more generally pth roots, sign(z), tanh(2:), and 
others |BGM81[ IBR09] . By |ST921 Lemma 6.2.1], the denominator q has all its roots 
in [a, /3], and the error may be represented as 



q q{zY 



pm+n+l 



Then the error on the disk {z e C : [zj < w{A)} is minimal for z = — w(A), and, 
using Example [lOLlHl 

\\{f-P-){A)\\<2\{f-^){-w{A))\. 

q q 

In general, sharper error bounds are obtained for fc > m by combining the above 
techniques with those of 1107.81^ where we suppose in addition that E is symmetric 
with respect to the real axis. We first notice that, with /, also T~^{f) is a Markov 
function |BR09[ Theorem 6.1(a)]. Denoting by P/Q the [k\m] Pade approximant of 
T~^{f), the function F(P/Q) is rational with numerator degree < k and denominator 
degree < m, called the [k\m\ Faber-Pade approximant |E1183j . For E = [—1,1] one 
recovers the so-called non-linear Chebyshev-Pade approximant [Sue09| . As above, we 
may bound for Markov functions the error through 

ll(/-^(^))(A)||<2|(^-(/)-^)(-l)|. 

5. Rational approximation with prescribed poles and rational Arnoldi. 

There exists a variant of the polynomial Arnoldi method 1107.8131 where the columns 
of Qm G C"^"* give an orthonormal basis of the rational Krylov subspace 

g(^)-iSpan{5, A6, . . . , ^""^6} 

for some fixed polynomial q{z) = Y\j{z — zj) of degree < m — 1 (and hence for q = 1 
we recover the polynomial Arnoldi method). The computation of V„i and V^AVm by 
some rational variant of the Arnoldi process jBR09j requires {A — Zjl)~^a for some 
vectors a, and this task of solving shifted linear systems is particularly trackable if q 
has a small number of multiple poles. As before we obtain 

"•^(^)^-^-(^^"'^^-)^-^" < IK/ - ^)(A)|| + U.f-'-)iv:AVr..)\\ 
ml q q 

for any polynomial p of degree < m — 1 [BR09[ Theorem 5.2]. Let M^(A) C E, and 
suppose that E is symmetric with respect to the real axis, and zj ^ E. For estimating 
the error we are left with the task of approaching / on E by a rational function with 
fixed denominator p/q. Notice that p/q = J- (P/Q) with P a polynomial of degree at 
most < m - 1, and Q{w) = JJ^iw - (p{zj)) [Ellgg] . Hence e„ < 4p^^_i(J"-i(/),D), 
where 

p1_i(/,IE)=^ niin max |/(z) - -(z)|, 

dcgp<?n— 1 z^E q 
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see [BR09[ Theorem 5.2]. For Markov functions / as in 1107. 8l4l lower and upper 
bounds for p^_j^(J^~^(/),D) arc given in |BR091 Theorem 6.2], in particular the 
simple explicit bound 

1 I /•/ \ M ItT ^ ~ 



P,?,_i(-F"i(/),D) < -— maxl/(z) - /(oo)] max [Q 7 

\<P(P)\ zelE t«e[0(a),0(/3)]l-^.-^ 1 



W(j){zj) 



Poles Zj minimizing the right-hand should therefore be in [a, /3], the set of singularities 
of /, and various configurations of poles minimizing the right-hand side have been 
considered in |BR09[ Section 6]. 

Error bounds for GMRES and FOM for solving systems of linear equations. 

Both GMRES and FOM are iterative Krylov subspace methods for solving systems 
Ax = b with A large and sparse, see Chapter 41. Here we may apply the techniques 
of the preceding sections for the Markov function /(z) = 1/z provided that ^ E. E 
containing W{A). The residual of the mth iterate x^^^'^^ of GMRES with starting 
residual r = b- Ax'^^^^-^^ satisfy |Bec05| 

. USlL< . 2 ^ ^ 2 + 1/10(0)1 



\\r\\ degp<rn 1p(0)1 llrll - ^'1F™(0)1^- 1(/)(0)1™ 

with Fm the mth Faber polynomial of E, and (f> mapping conformally the exterior of 
E onto the exterior of the unit disk. The asymptotic convergence factor 1/10(0)] < 1 
can be computed for various shapes of E. For instance |Bec05[ IBGTOGj . for positive 
definite A + A* considering the lens E = {z e C : re(z) > dist(0, W{A)), \z\ < w{A)} 
we get 1/10(0)1 = 2sin(/3/(4 - 2/3/7r)) < sin(/3), with the angle ^ G (0,7r/2) being 

defined by cos(/3) = ^^^^i"'^^^^^ ■ 

The mth iterate of FOM (with starting vector Xq^^ is a special case of the polynomial 
Arnoldi methodHUTMl namely x™*^ = Vm{V;;,AV^)-^V^b or f{z) = 1/z, and thus 



FOM 



<47?™_i(^-^(/),D) < 



410(0)]-" 
dist(0,E) 
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